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Experiences with selecting search engines using metasearch 
Daniel Dreilinger, Adele E. Howe 

July 1997 ACM Transactions on Information Systems (TOIS), volume is issue 3 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms, review 



Full text available:^ pdf ( 428.65 KB ) 



Search engines are among the most useful and high-profile resources on the Internet. The 
problem of finding information on the Internet has been replaced with the problem of 
knowing where search engines are, what they are designed to retrieve, and how to use 
them. This article describes and evaluates SavvySearch, a metasearch engine designed to 
intelligently select and interface with multiple remote search engines. The primary 
metasearch issue examined is the importance of carefully selecti ... 

Keywords: WWW, information retrieval, machine learning, search engine 



SDLIP + STARTS = S DARTS a protocol and toolkit for metasearchin g 
Noah Green, Panagiotis G. Ipeirotis, Luis Gravano 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital 
libraries JCDL '01 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings, index 
terms 



Full text available: * g pdf(301 .52 KB) 



In this paper we describe how we combined SDLIP and STARTS, two comple mentary 
protocols for searching over distributed document collections. The resulting protocol, which 
we call SDARTS, is simple yet expressible enough to enable building sophisticated 
metasearch engines. SDARTS can be viewed as an instantiation of SDLIP with metasearch- 
specific elements from STARTS. We also report on our experience building three SDARTS- 
compliant wrappers: for locally available plain-text document collect ... 

Predicate rewritin g for translatin g Boolean queries in a heter og eneous information 
s ystem 

Chen-Chuan K. Chang, Hector Garcia-Molina, Andreas Paepcke 

January 1999 ACM Transactions on Information Systems (TOIS), volume 17 issue l 
Publisher: ACM Press 

r- ... . ui Additional Information: full citation , abstract , references , citings , index 
Full text available: ^ 



http://portal.acm.org/results.c^ 11/29/07 



Results (page 1): + M metasearch" ^translate +format +native -t-search 



Page 2 of 4 



^ pdf(350.96 KB) terms 

Searching over heterogeneous information sources is difficult in part because of the 
nonuniform query languages. Our approach is to allow users to compose Boolean queries 
in one rich front-end language. For each user query and target source, we transform the 
user query into a subsuming query that can be supported by the source but that may 
return extra documents. The results are then processed by a filter query to yield the 
correct final results. In this article we introduce the architectur ... 



Keywords: Boolean queries, content-based retrieval, filtering, predicate rewriting, query 
subsumption, query translation 



STARTS: Stanford proposal for Internet meta-searchin g 

Luis Gravano, Chen-Chuan K. Chang, Hector Garcfa-Molina, Andreas Paepcke 

June 1997 ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 

conference on Management of data SIGMOD '97, volume 26 issue 2 
Publisher: ACM Press 

Full text available: « pdff1.53Mm Additional Information: full citation , abstract, L eJerences, citings, index 
^ terms 

Document sources are available everywhere, both within the internal networks of 
organizations and on the Internet. Even individual organizations use search engines from 
different vendors to index their internal document collections. These search engines are 
typically incompatible in that they support different query models and interfaces, they do 
not return enough information with the query results for adequate merging of the results, 
and finally, in that they do not export metadata about t ... 

S u mmary in context: Searchin g v ersus brows ing 
Daniel M. McDonald, Hsinchun Chen 

January 2006 ACM Transactions on Information Systems (TOIS), volume 24 issue l 
Publisher: ACM Press 

Full text available: Q pdf( 5 3 0.99 KB ) Additional Information: f ull citation , abstrac t, ref er en ces , index tern 

The use of text summaries in information-seeking research has focused on query-based 
summaries. Extracting content that resembles the query alone, however, ignores the 
greater context of the document. Such context may be central to the purpose and 
meaning of the document. We developed a generic, a query-based, and a hybrid 
summarizer, each with differing amounts of document context. The generic summarizer 
used a blend of discourse information and information obtained through traditional 
surface- ... 

Keywords: Summarization, browse, generic summaries, indicative summaries, 
information seeking, natural language processing, search, text processing 



6 Dig ital libraries for spatial data: The ADEPT digital library architecture 
^ Greg Janee, James Frew 

^ July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 
JCDL 02 

Publisher: ACM Press 

Full text available: « pdf(263.61 KB) Additional Information: full citation , abstract, references , dfcgs, index 

terms 

The Alexandria Digital Earth ProtoType (ADEPT) architecture is a framework for building 
distributed digital libraries of georeferenced information. An ADEPT system comprises one 
or more autonomous libraries, each of which provides a uniform interface to one or more 
collections, each of which manages metadata for one or more items. The primary standard 
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on which the architecture is based is the ADEPT bucket framework, which defines uniform 
client-level metadata query services that are compatible w ... 

Keywords: bucket framework, collection discovery, distribution, interoperability, 
metadata 



7 Sess i on 1 : Lev e ra ging semantic tech n olo g ies for ent erp r i se search 
Gianluca Demartini 

November 2007 Proceedings of the ACM first Ph.D. workshop in CIKM PIKM '07 
Publisher: ACM 

Full text available: ^ pdfd 84.47 KB) Additional Information: full citation , abstract , references , index terms 

Enterprise search is very different from Web search (for example in the link structure or in 
the user's needs and goal)and some steps have been already done to exploit these 
differences in order to improve the effectiveness of enterprise search. In this paper we 
present the state of the art of the enterprise search field with some open issues. We also 
present a research plan that aims at using Information Retrieval, Semantic Web, and User 
Modelling techniques to cope with these issues improvi ... 

Keywords: enterprise search, evaluation, expert search, personalization 



Early user—system interaction for database selection in massive domain-specific 

o n lin e env iro nments 

Jack G. Conrad, Joanne R. S. Claussen 

January 2003 ACM Transactions on Information Systems (TOIS), volume 21 issue 1 
Publisher: ACM Press 

Full text available: ^ pdf(845.54 KB) Additional Information: full citation , abstract , references , index terms 

The continued growth of very large data environments such as Westlaw and Dialog, in 
addition to the World Wide Web, increases the importance of effective and efficient 
database selection and searching. Current research focuses largely on completely 
autonomous and automatic selection, searching, and results merging in distributed 
environments. This fully automatic approach has significant deficiencies, including reliance 
upon thresholds below which databases with relevant documents are not search ... 

Keywords: Database selection, metadata for retrieval, structuring information to aid 
search and navigation, user interaction 



Summarization and question answering: Using librarian techniques in automatic text Q 
summarization for information retrieval 
Min-Yen Kan, Judith L. Klavans 

July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 
JCDL '02 

Publisher: ACM Press 

Full text available* fiFl odfd 15 MB) Additional Information: full citation , abstr act, references , citin gs, index 
u i * T2J p U terms 

A current application of automatic text summarization is to provide an overview of relevant 
documents coming from an information retrieval (IR) system. This paper examines how 
Centrifuser, one such summarization system, was designed with respect to methods used 
in the library community. We have reviewed these librarian expert techniques to assist 
information seekers and codified them into eight distinct strategies. We detail how we 
have operationalized six of these strategies in Centrifuser by c ... 
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^ s ystem 

" Chen-Chuan K. Chang, Hector Garcia-Molina, Andreas Paepcke 

January 1999 ACM Transactions on Information Systems (TOIS), volume 17 issue i 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^pdf(3 50.96 KB ) 



Searching over heterogeneous information sources is difficult in part because of the 
nonuniform query languages. Our approach is to allow users to compose Boolean queries 
in one rich front-end language. For each user query and target source, we transform the 
user query into a subsuming query that can be supported by the source but that may 
return extra documents. The results are then processed by a filter query to yield the 
correct final results. In this article we introduce the architectur ... 

Keywords: Boolean queries, content-based retrieval, filtering, predicate rewriting, query 
subsumption, query translation 



Ex periences with selectin g search en gi nes usin g metas ea rc h 
Daniel Dreilinger, Adele E. Howe 

July 1997 ACM Transactions on Information Systems (TOIS), volume is issue 3 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , ci ti ngs, index 
terms, revie w 



Full text available: ^g pdf(428.65 KB) 



Search engines are among the most useful and high-profile resources on the Internet. The 
problem of finding information on the Internet has been replaced with the problem of 
knowing where search engines are, what they are designed to retrieve, and how to use 
them. This article describes and evaluates SavvySearch, a metasearch engine designed to 
intelligently select and interface with multiple remote search engines. The primary 
metasearch issue examined is the importance of carefully selecti ... 

Keywords: WWW, information retrieval, machine learning, search engine 
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Thomas Kunz, Michiel F. H. Seuren 
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November 1997 Proceedings of the 1997 conference of the Centre for Advanced 
Studies on Collaborative research CASCON '97 

Publisher: IBM Press 

Full text available: ^ pdf(4.21 MB) Additional Information: full citation , abstract, references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the execution 
of the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not 
provide the user with the desired overview of the application. In our experience, such tools 
display repeated occurrences of non-trivial commun ... 

4 Embeddin g web-based statistical translation models in cross-lan g ua g e information 
retrieval 

Wessel Kraaij, Jian-Yun Nie, Michel Simard 

September 2003 Computational Linguistics, volume 29 issue 3 

Publisher: MIT Press 

Full text available* f" 1 ) odf(381 29 KB) Additional Information: full citation , abstract , references , citings , index 
' ^ ~ '~ terms 

Although more and more language pairs are covered by machine translation (MT) services, 
there are still many pairs that lack translation resources. Cross-language information 
retrieval (CLIR) is an application that needs translation functionality of a relatively low 
level of sophistication, since current models for information retrieval (IR) are still based on 
a bag of words. The Web provides a vast resource for the automatic construction of 
parallel corpora that can be used to train statistical ... 

5 Human-computer interaction: Analysis of user attitude and behaviour in evaluatin g a 
personalized search en g ine 
Effie Lai-Chong Law, Tomaz Klobucar, Matic Pipan 

September 2006 Proceedings of the 13th Eurpoean conference on Cognitive 

ergonomics: trust and control in complex socio-technical systems 
ECCE '06 

Publisher: ACM Press 

Full text available: ^|pdf (508.34 KB) Additional Information: full citation , abstract , references 

This paper reports an empirical work on user-based relevance evaluation of a personalized 
search engine (PSE). The aim of the work is threefold: To develop metrics for evaluating 
PSE; to study how users' trust in personalized information retrieval systems influences 
their relevance judgments; to identify patterns of relevance criteria applications. Our 
findings corroborate some of those of the previous work and reveal some new 
phenomena: the optimality of sample size, consistency of relevance ... 

Keywords: information entropy, information retrieval, monte-carlo simulation, 
personalization, relevance criteria 



STARTS: Stanford proposal for Internet meta-searchin g 

Luis Gravano, Chen-Chuan K. Chang, Hector Garcfa-Molina, Andreas Paepcke 

June 1997 ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 

conference on Management of data SIGMOD '97, volume 26 issue 2 
Publisher: ACM Press 

Full text available- f33 Ddfd 53 MB) Additional Information: full citation , abstract , references , citin gs, index 
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Document sources are available everywhere, both within the internal networks of 
organizations and on the Internet. Even individual organizations use search engines from 
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different vendors to index their internal document collections. These search engines are 
typically incompatible in that they support different query models and interfaces, they do 
not return enough information with the query results for adequate merging of the results, 
and finally, in that they do not export metadata about t ... 

Makin g MIRACLEs: Intera c t i ve translin g ua l s earch for Cebuano and H in di 
Daqing He, Douglas W. Oard, Jianqiang Wang, Jun Luo, Dina Demner-Fushman, Kareem 
Darwish, Philip Resnik, Sanjeev Khudanpur, Michael Nossal, Michael Subotin, Anton Leuski 
September 2003 ACM Transactions on Asian Language Information Processing 

(TALIP), Volume 2 Issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(209.29 KB ) Additional Information: full citation , abstract , references , index terms 

Searching is inherently a user-centered process; people pose the questions for which 
machines seek answers, and ultimately people judge the degree to which retrieved 
documents meet their needs. Rapid development of interactive systems that use queries 
expressed in one language to search documents written in another poses five key 
challenges: (1) interaction design, (2) query formulation, (3) cross-language search, (4) 
construction of translated summaries, and (5) machine translation. This articl ... 

Keywords: Cross-language information retrieval, Interactive information retrieval, 
Machine translation 



SDLIP + STARTS = SDARTS a protocol and toolkit for metasearchinq 
Noah Green, Panagiotis G. Ipeirotis, Luis Gravano 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital 
libraries JCDL '01 

Publisher: ACM Press 

p ii t . i ut « ,r/o^ co i/n\ Additional Information: full cita tion, abstract, r eferences , citings, index 

Full text available: TO pdf(301.52 KB) *~ 

ter ms 

In this paper we describe how we combined SDLIP and STARTS, two comple mentary 
protocols for searching over distributed document collections. The resulting protocol, which 
we call SDARTS, is simple yet expressible enough to enable building sophisticated 
metasearch engines. SDARTS can be viewed as an instantiation of SDLIP with metasearch- 
specific elements from STARTS. We also report on our experience building three SDARTS- 
compliant wrappers: for locally available plain-text document collect ... 

A mediation infrastructure for di g ital library services 
Sergey Melnik, Hector Garcia-Molina, Andreas Paepcke 

June 2000 Proceedings of the fifth ACM conference on Digital libraries DL 'OO 

Publisher: ACM Press 

Full text available: « pdf(155.30 KB) Additional Information: full citation , abstract, references, citings, index 
^ terms 

Digital library mediators allow interoperation between diverse information services. In this 
paper we describe a flexible and dynamic mediator infrastructure that allows mediators to 
be composed from a set of modules C 'blades"). Each module implements a particular 
mediation function, such as protocol translation, query translation, or result merging. All 
the information used by the mediator, including the mediator logic itself, is represented by 
an RDF graph. We i ... 

Keywords: component design, interoperability, mediator, wrapper 
Metadata for digital libraries: architecture and desi g n rationale 
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Michelle Baldonado, Chen-Chuan K. Chang, Luis Gravano, Andreas Paepcke 

July 1997 Proceedings of the second ACM international conference on Digital 

libraries DL '97 
Publisher: ACM Press 

Full text available: ^pdfd.65 MB) Additional Information: full citation , references , citings , index terms 



Keywords: CORBA, InfoBus, attrabute model translation, attribute model translation, 
digital libraries, heterogeneity, interoperability, metadata architecture, metadata 
repository, proxy architecture 



11 Tools and approaches for developing data-intensive Web a p plications: a surve y 
Piero Fraternali 

September 1999 ACM Computing Surveys (CSUR), volume 3i issue 3 
Publisher: ACM Press 

Full text available: fg| pdf(524.80 KB) Additional Information: Ml citation, abstract, reLerences, citings, index 
^ terms 

The exponential growth and capillar diffusion of the Web are nurturing a novel generation 
of applications, characterized by a direct business-to-customer relationship. The 
development of such applications is a hybrid between traditional IS development and 
Hypermedia authoring, and challenges the existing tools and approaches for software 
production. This paper investigates the current situation of Web development tools, both 
in the commercial and research fields, by identifying and characte ... 

Keywords: HTML, Intranet, WWW, application, development 



1 2 An extensible constructor tool for the rapid , interactive desi g n of query s ynthesizers Q 
A Michelle Baldonado, Seth Katz, Andreas Paepcke, Chen-Chuan Chang, Hector Garcia-Molina, 
^ Terry Winograd 

May 1998 Proceedings of the third ACM conference on Digital libraries DL 98 
Publisher: ACM Press 

Full text available: ^ pdf(1 .75 MB) Additional Information: full citation , references , citings , index terms 



13 An analysis of XML database solutions for the management of MPEG-7 media 

d es c ri pt ions 
" Utz Westermann, Wolfgang Klas 

December 2003 ACM Computing Surveys (CSUR), volume 35 issue 4 

Publisher: ACM Press 

Full text available: fB pdff448.76 KB) Additional Information: MeHalba, abstract, references , citings, index 

terms , review 

MPEG-7 constitutes a promising standard for the description of multimedia content. It can 
be expected that a lot of applications based on MPEG-7 media descriptions will be set up in 
the near future. Therefore, means for the adequate management of large amounts of 
MPEG-7-compliant media descriptions are certainly desirable. Essentially, MPEG-7 media 
descriptions are XML documents following media description schemes defined with a 
variant of XML Schema. Thus, it is reasonable to investigate curren ... 

Keywords: MPEG-7, XML database systems, multimedia databases 
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14 Searchin g and information extracting: Multimedia information services enablin g : an 

^ architectural approach 

^ Erik Boertjes, Willem Jonker, Jeroen Wijnands 

September 2001 Proceedings of the 2001 ACM workshops on Multimedia: multimedia 
information retrieval MULTIMEDIA '01 

Publisher: ACM Press 

Full text available: ^| pdf(599.94 KB ) Additional Information: full citation , abstract , references , index terms 

This paper presents a scalable and extendable architecture consisting of the essential 
building blocks for multimedia information services. It provides building blocks for 
multimedia transport, storage, retrieval, filtering, and presentation, together with their 
interdependences. After presenting the overall architecture, we focus in more detail on 
the 3-level modeling and querying of multimedia data. Emphasis is placed on the support 
for a wide variety of modeling and querying techniques in th ... 

Keywords: information management, metadata management, multimedia search, 
multimedia services, platform architectures, query processing 



1 5 Summar y i n c onte xt: S earchin g versus browsin g 
Daniel M. McDonald, Hsinchun Chen 

January 2006 ACM Transactions on Information Systems (TOIS), volume 24 issue l 
Publisher: ACM Press 

Full text available: ^ pdf(530.99 KB) Additional Information: full citation , abstract , references , index terms 

The use of text summaries in information-seeking research has focused on query-based 
summaries. Extracting content that resembles the query alone, however, ignores the 
greater context of the document. Such context may be central to the purpose and 
meaning of the document. We developed a generic, a query-based, and a hybrid 
summarizer, each with differing amounts of document context. The generic summarizer 
used a, blend of discourse information and information obtained through traditional 
surface- ... 

Keywords: Summarization, browse, generic summaries, indicative summaries, 
information seeking, natural language processing, search, text processing 



16 Web-based specification and integration of leg acy services 
Ying Zou, Kostas Kontogiannis 

November 2000 Proceedings of the 2000 conference of the Centre for Advanced 
Studies on Collaborative research CASCON '00 

Publisher: IBM Press 

Full text available: Q pdf(279.28 KB) Additional Information: full citation , abstract , references , index terms 

With the explosive growth of the Internet, businesses of all sizes aim on applying 
networkwide solutions to their IT infrastructures, migrating their legacy business 
processes into web-based environments, and establishing their own on-line services. To 
facilitate process and service integration, a complete and information rich service 
description language, is essential for server processes to be specified and for client 
processes to be able to locate services that are available in Web-enabled re ... 

17 Shallow NLP techniques for internet search 
Alex Penev, Raymond Wong 

January 2006 Proceedings of the 29th Australasian Computer Science Conference - 
Volume 48 ACSC '06 

Publisher: Australian Computer Society, Inc. 

Full text available: Additional Information: 
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Information Retrieval (IR) is a major component in many of our daily activities, with 
perhaps its most prominent role manifested in search engines. Today's most advanced 
engines use the keyword-based ("bag of words") paradigm, which concedes some inherent 
disadvantages. We believe that natural language (NL) is a more user-oriented, context- 
preservative and intuitive mechanism for web search. In this paper, we explore shallow 
NLP techniques to support a range of NL queries over an existing keyword ... 

Keywords: google, information retrieval, natural language, processing 



18 Linguistic resource creation for research and technology development: A recent 
ex periment 

" Stephanie Strassel, Mike Maxwell, Christopher Cieri 

June 2003 ACM Transactions on Asian Language Information Processing (TALIP), 

Volume 2 Issue 2 
Publisher: ACM Press 

Full text available - H3 Ddfd 86 39 KB) Additional Information: full citation , abstract , references , citings, index 
u v i -T2J_B — = terms 

Advances in statistical machine learning encourage language-independent approaches to 
linguistic technology development. Experiments in "porting" technologies to handle new 
natural languages have revealed a great potential for multilingual computing, but also a 
frustrating lack of linguistic resources for most languages. Recent efforts to address the 
lack of available resources have focused either on intensive resource development for a 
small number of languages or development of technologies fo ... 

Keywords: Cebuano, Hindi, Machine translation, crosslanguage, information extraction, 
information retrieval, language parsing and understanding, linguistic resources, machine 
translation, summarization, text analysis, translingual information access technology 



19 Digital library communities and change: Cross-cultural usability of the library 
^ me ta phor 
^ Elke Duncker 

July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 
JCDL '02 

Publisher: ACM Press 

Full text available- HI pdf(202 45 KB) Ac,c,it ' ona, Information: full citation , abstract , references , cit in gs, index 

- terms 

Computing metaphors have become an integral part of information systems design, yet 
they are deeply rooted in cultural practices. This paper presents an investigation of the 
cross-cultural use and usability of such metaphors by studying the library metaphor of 
digital libraries in the cultural context of the Maori, the indigenous population of New 
Zealand. The ethnographic study examines relevant features of the Maori culture, their 
form of knowledge transfer and their use of physical and digita ... 

Keywords: computing metaphor, cross-cultural usability, digital library, globalization, 
indigenous culture, localization 



20 Cross-language: Bootstrap pin g dictionaries for cross-language information retrieval 
Kornel Marko, Stefan Schulz, Olena Medelyan, Udo Hahn 

August 2005 Proceedings of the 28th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '05 

Publisher: ACM Press 
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The bottleneck for dictionary-based cross-language information retrieval is the lack of 
comprehensive dictionaries, in particular for many different languages. We here introduce 
a methodology by which multilingual dictionaries (for Spanish and Swedish) emerge 
automatically from simple seed lexicons. These seed lexicons are automatically generated, 
by cognate mapping, from (previously manually constructed) Portuguese and German as 
well as English sources. Lexical and semantic hypotheses are then ... 

Keywords: cross-language information retrieval, lexical acquisition 
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Additional Information: full citation , abstract , references , citings , index 
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Full text available: ^pdf d 55.30 KB) 



Digital library mediators allow interoperation between diverse information services. In this 
paper we describe a flexible and dynamic mediator infrastructure that allows mediators to 
be composed from a set of modules (* * blades"). Each module implements a particular 
mediation function, such as protocol translation, query translation, or result merging. All 
the information used by the mediator, including the mediator logic itself, is represented by 
an RDF graph. We i ... 

Keywords: component design, interoperability, mediator, wrapper 



Combinators for bidirectional tree transformations: A lin g uistic a p proach to the view- 
update problem 

J. Nathan Foster, Michael B. Greenwald, Jonathan T. Moore, Benjamin C. Pierce, Alan Schmitt 
May 2007 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 29 Issue 3 
Publisher: ACM Press 

Additional Information: full citation , a ppendices and su pple ments , 
abstract , references , index terms 



Full text available:^ pdf(1 . 06 MB ) 



We propose a novel approach to the view-update problem for tree-structured data: a 
domain-specific programming language in which all expressions denote bidirectional 
transformations on trees. In one direction, these transformations— dubbed lenses— map a 
concrete tree into a simplified abstract view; in the other, they map a modified abstract 
view, together with the original concrete tree, to a correspondingly modified concrete tree. 
Our design emphasizes both robustness and ea ... 

Keywords: Bidirectional programming, Harmony, XML, lenses, view update problem 
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21 H y pe rmedia and Graphics 2: Vector g raphics: from PostSc r i p t ahd Flash to SVG 
Steve Probets, Julius Mong, David Evans, David Brailsford 

November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 
DocEng '01 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citing s, index 
terms 



Full text available: ^ pdf(1 27. 00 KB ) 



The XML-based specification for Scalable Vector Graphics(SVG), sponsored by the World 
Wide Web consortium, allows for compact and descriptive vector graphics for the Web.This 
paper describes a set of three tools for creating SVG, either from first principles or via the 
conversion of existing formats. The ab initio generation of SVG is effected from a server- 
side CGI script, using a PERL library of drawing functions; later sections highlight the 
problems of converting Adobe PostScript and ... 



Keywords: Flash, PDF, PostScript, SVG, SWF 



22 Sup porting education: Metadata a ggreg ation and "automated di g ital libraries": a 
retrospectiv e o n the NSDL experience 

Carl Lagoze, Dean Krafft, Tim Cornwell, Naomi Dushay, Dean Eckstrom, John Saylor 
June 2006 Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries 

JCDL '06 
Publisher: ACM Press 

Full text available: ^pdff 346.87 KB) Additional Information: full citation , abstract, references , index terms 

Over three years ago, the Core Integration team of the National Science Digital Library 
(NSDL) implemented a digital library based on metadata aggregation using Dublin Core 
and OAI-PMH. The initial expectation was that such low-barrier technologies would be 
relatively easy to automate and administer. While this architectural choice permitted rapid 
deployment of a production NSDL, our three years of experience have contradicted our 
original expectations of easy automation and low people cost. We ... 

Keywords: NSDL, OAI-PMH, architecture, interoperability, metadata 
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August 2002 Proceedings of the 28th international conference on Very Large Data 
Bases - Volume 28 VLDB '2002 

Publisher: VLDB Endowment 

Full text available: Q pdf(287.29 KB) Additional Information: full citation , abstract , references , index terms 

In spite of the many decades of progress in database research, surprisingly scientists in 
the life sciences community still struggle with inefficient and awkward tools for querying 
biological data sets. This work highlights a specific problem involving searching large 
volumes of protein data sets based on their secondary structure. In this paper we define 
an intuitive query language that can be used to express queries on secondary structure 
and develop several algorithms for evaluating these ... 

24 Predicate-calculus-based logics for modeling and solving search problems 
Deborah East, Miroslaw Truszczynski 

January 2006 ACM Transactions on Computational Logic (TOCL), volume 7 issue l 
Publisher: ACM Press 

Full text available: pdf(300.38 KB) Additional Information: full citation , abstract , references , index terms 

The answer-set programming (ASP) paradigm is a way of using logic to solve search 
problems. Given a search problem, to solve it one designs a logic theory so that models of 
this theory represent problem solutions. To compute a solution to the problem, one 
computes a model of the theory. Several answer-set programming formalisms have been 
developed on the basis of logic programming with the semantics of answer sets. In this 
article we show that predicate logic also gives rise to effective ... 

Keywords: Satisfiability, constraints, predicate logic, pseudo-Boolean constraints, search 
problems 



25 Bioinformatics (BIO): An architecture for biological information extraction and 
<|k re presentation 

^ Aditya Vailaya, Peter Bluvas, Robert Kincaid, Allan Kuchinsky, Michael Creech, Annette Adler 
March 2004 Proceedings of the 2004 ACM symposium on Applied computing SAC '04 

Publisher: ACM Press 

Full text available: ^ pdf(355.71 KB) Additional Information: full citation , abstract , references 

Technological advances in biomedical research are generating a plethora of heterogeneous 
data at a high rate. There is a critical need for extraction, integration and management 
tools for information discovery and synthesis from these heterogeneous data. In this 
paper, we present a general architecture, called ALFA, for information extraction and 
representation from diverse biological data. The ALFA architecture consists of: (i) a 
networked, hierarchical object model for representing information ... 

Keywords: bioinformatics, filtering, heterogeneous data, information representation, 
information retrieval, interactive text mining, software architecture, user-guided 
information extraction 
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Volume 29 Issue 3 
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Full text available: fB pdfd.06 MB) Additional Information: full citation, app endices and su ppjements, 
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We propose a novel approach to the view-update problem for tree-structured data: a 
domain-specific programming language in which all expressions denote bidirectional 
transformations on trees. In one direction, these transformations— dubbed lenses— map a 
concrete tree into a simplified abstract view; in the other, they map a modified abstract 
view, together with the original concrete tree, to a correspondingly modified concrete tree. 
Our design emphasizes both robustness and ea ... 

Keywords: Bidirectional programming, Harmony, XML, lenses, view update problem 



27 TotalRecall: a bilingual concordance for computer assisted translation and language 
learning 

Jian-Cheng Wu, Kevin C. Yeh, Thomas C. Chuang, Wen-Chi Shei, Jason S. Chang 
July 2003 Proceedings of the 41st Annual Meeting on Association for Computational 
Linguistics - Volume 2 ACL '03 

Publisher: Association for Computational Linguistics 

Full text available: ^S] pdf(94.91 KB ) Additional Information: full citation , abstract , references 

This paper describes a Web-based English-Chinese concordance system, Total-Recall, 
developed to promote translation reuse and encourage authentic and idiomatic use in 
second language writing. We exploited and structured existing high-quality translations 
from the bilingual Sinorama Magazine to build the concordance of authentic text and 
translation. Novel approaches were taken to provide high-precision bilingual alignment on 
the sentence, phrase and word levels. A browser-based user interface (U ... 

28 Develop ing re g ions 2: WebKho j : India n lang ua g e IR from multiple character 
encodings 

Prasad Pingali, Jagadeesh Jagarlamudi, Vasudeva Varma 

May 2006 Proceedings of the 15th international conference on World Wide Web 
WWW '06 

Publisher: ACM Press 

Full text available: ^ pdf(480.79 KB) Additional Information: full citation , abstract , references , index terms 

Today web search engines provide the easiest way to reach information on the web. In 
this scenario, more than 95% of Indian language content on the web is not searchable due 
to multiple encodings of web pages. Most of these encodings are proprietary and hence 
need some kind of standardization for making the content accessible via a search engine. 
In this paper we present a search engine called WebKhoj which is capable of searching 
multi-script and multi-encoded Indian language content on the web. ... 

Keywords: Indian languages, non-standard encodings, web search 



29 Plagiarism detection across pro g rammin g lan guages 
Christian Arwin, S. M. M. Tahaghoghi 

January 2006 Proceedings of the 29th Australasian Computer Science Conference - 
Volume 48 ACSC '06 

Publisher: Australian Computer Society, Inc. 

Full text available: ^ pdf(193.09 KB) Additional Information: full citation , abstract , references , index terms 

Plagiarism is a widespread problem in assessment tasks; in computing courses, students 
often plagiarise source code. For all but the smallest classes, manual detection of such 
plagiarism is impractical, and, while automated tools are available, none has been applied 
to detect inter-lingual plagiarism, where source code is copied from one language to 
another. In this work, we propose a novel approach, XPlag, to detect plagiarism involving 
multiple languages using intermediate program code produce ... 
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30 Community tech: Wikifyin g your interface: facilitatin g community-based interface 
translation 

M. Cameron Jones, Dinesh Rathi, Michael B. Twidale 

June 2006 Proceedings of the 6th conference on Designing Interactive systems DIS 
06 

Publisher: ACM 

Full text available: ^| pdf(1.07 MB) Additional Information: full citation , abstract , references , index terms 

We explore the application of a wiki-based technology and style of interaction to enabling 
the incremental translation of a collaborative application into a number of different 
languages, including variant English language interfaces better suited to the needs of 
particular user communities. The development work allows us to explore in more detail the 
design space of functionality and interfaces relating to tailoring, customization, 
personalization and localization, and the challenges of design! ... 

Keywords: community-based technology, customization, internationalization, mash-ups, 
personalization, wikis 



31 Propositional Satisfiability and Constraint Programming: A comparative survey 
Lucas Bordeaux, Youssef Hamadi, Lintao Zhang 
December 2006 ACM Computing Surveys (CSUR), volume 38 issue 4 

Publisher: ACM Press 

Full text available: |j^ pdf(878.93 KB) Additional Information: full citation , abstract , references , index terms 

Propositional Satisfiability (SAT) and Constraint Programming (CP) have developed as two 
relatively independent threads of research cross-fertilizing occasionally. These two 
approaches to problem solving have a lot in common as evidenced by similar ideas 
underlying the branch and prune algorithms that are most successful at solving both kinds 
of problems. They also exhibit differences in the way they are used to state and solve 
problems since SAT's approach is, in general, a black-box approach, ... 

Keywords: SAT, Search, constraint satisfaction 





32 S ession 1: Le ve raging se ma ntic techn olo gi es for ente rp r ise search 

Gianluca Demartini 

November 2007 Proceedings of the ACM first Ph.D. workshop in CIKM PIKM '07 
Publisher: ACM 

Full text available:^] pdf( 1 84.47 KB ) Additional Information: full citat ion, abstrac t, reference s, index te rms 

Enterprise search is very different from Web search (for example in the link structure or in 
the user's needs and goal)and some steps have been already done to exploit these 
differences in order to improve the effectiveness of enterprise search. In this paper we 
present the state of the art of the enterprise search field with some open issues. We also 
present a research plan that aims at using Information Retrieval, Semantic Web, and User 
Modelling techniques to cope with these issues improvi ... 

Keywords: enterprise search, evaluation, expert search, personalization 
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September 2006 Computational Linguistics, volume 32 issue 3 
Publisher: MIT Press 

Full text available: ^ pdf( 987 . 55 KB ) Additional Information: f u ll ci tation , abstract , references , index te rms 

Since the Web by far represents the largest public repository of natural language texts, 
recent experiments, methods, and tools in the area of corpus linguistics often use the Web 
as a corpus. For applications where high accuracy is crucial, the problem has to be faced 
that a non-negligible number of orthographic and grammatical errors occur in Web 
documents. In this article we investigate the distribution of orthographic errors of various 
types in Web pages. As a by-product, methods are develop ... 

34 Inter o perability for d ig ital libraries wo r ldwide 

Andreas Paepcke, Chen-Chuan K. Chang, Terry Winograd, Hector Garcfa-Molina 
April 1998 Communications of the ACM, volume 4i issue 4 
Publisher: ACM Press 

Full text available: ^| pdf(299.48 KB) Additional Information: full citation , references , citings, index terms 



35 Virtualization and operating systems: Libra: a library operating system for a jvm in a 
virtualized ex ecution environment 

Glenn Ammons, Jonathan Appavoo, Maria Butrico, Dilma Da Silva, David Grove, Kiyokuni 
Kawachiya, Orran Krieger, Bryan Rosenburg, Eric Van Hensbergen, Robert W. Wisniewski 
June 2007 Proceedings of the 3rd international conference on Virtual execution 

environments VEE '07 
Publisher: ACM Press 

Full text available: ^| pdf(223.69 KB) Additional Information: full citation , abstract , references , index terms 

If the operating system could be specialized for every application, many applications would 
run faster. For example, Java virtual machines (JVMs) provide their own threading model 
and memory protection, so general-purpose operating system implementations of these 
abstractions are redundant. However, traditional means of transforming existing systems 
into specialized systems are difficult to adopt because they require replacing the entire 
operating system. This paper describes Libra, an execut ... 

Keywords: JVM, exokernels, virtualization, xen 



36 Usin g ke yphrases as search result surrogates on small screen devices 
Steve Jones, Matt Jones, Shaleen Deo_andA2 

February 2004 Personal and Ubiquitous Computing, volume 3 issue l 
Publisher: Springer-Verlag 

Full text available: Q pdf(675.87 KB) Additional Information: full citation , abstract , citings, index terms 

This paper investigates user interpretation of search result displays on small screen 
devices. Such devices present interesting design challenges given their limited display 
capabilities, particularly in relation to screen size. Our aim is to provide users with succinct 
yet useful representations of search results that allow rapid and accurate decisions to be 
made about the utility of result documents, yet minimize user actions (such as scrolling), 
the use of device resources, and the volume of ... 

Keywords: Keyphrase extraction, Searching, Small screen devices, Usability evaluation 
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R. J. Bayardo, W. Bohrer, R. Brice, A. Cichocki, J. Fowler, A. Helal, V. Kashyap, T. Ksiezyk, G. 
Martin, M. Nodine, M. Rashid, M. Rusinkiewicz, R. Shea, C. Unnikrishnan, A. Unruh, D. Woelk 
June 1997 ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 

conference on Management of data SIGMOD '97, volume 26 issue 2 
Publisher: ACM Press 

Full text available* -fS Ddfd 69 MB) Additional Information: fu ll citation , abstract , reference s, cit i n gs, index 
' ^ ^ terms 

The goal of the InfoSleuth project at MCC is to exploit and synthesize new technologies 
into a unified system that retrieves and processes information in an ever-changing 
network of information sources. InfoSleuth has its roots in the Camot project at MCC, 
which specialized in integrating heterogeneous information bases. However, recent 
emerging technologies such as internetworking and the World Wide Web have significantly 
expanded the types, availability, and volume of data available to a ... 

38 MPEG-4: an object-based multimedia coding standard supporting mobile applications 
Atul Puri, Alexandras Eleftheriadis 

June 1998 Mobile Networks and Applications, volume 3 issue l 
Publisher: Kluwer Academic Publishers 

Full text available: 15| pdfl747.80 KB) Additional Information: full citation, abstract, references , citings, index 
* ^ ~~ ~ ~ terms , review 

The ISO MPEG committee, after successful completion of the MPEG-1 and the MPEG-2 
standards is currently working on MPEG-4, the third MPEG standard. Originally, MPEG-4 
was conceived to be a standard for coding of limited complexity audio-visual scenes at 
very low bit-rates; however, in July 1994, its scope was expanded to include coding of 
scenes as a collection of individual audio-visual objects and enabling a range of advanced 
functionalities not supported by other standards. One of the ke ... 

39 S ystems: A retrospective look at Greenstone: lessons from the first decade 
Ian H. Witten, David Bainbridge 

June 2007 Proceedings of the 2007 conference on Digital libraries JCDL '07 
Publisher: ACM Press 

Full text available: ^| pdf(572.61 KB) Additional Information: full citation , abstract , references , index terms 

The Greenstone Digital Library Software has helped spread the practical impact of digital 
library technology throughout the world, with particular emphasis on developing countries. 
As Greenstone enters its second decade, this article takes a retrospective look at its 
development, the challenges that have been faced, and the lessons that have been 
learned in developing and deploying a comprehensive open-source system for the 
construction of digital libraries internationally. Not surprisingly, ... 

Keywords: architecture, greenstone, internationalization 



40 BrowserShield: Vulnerability-driven filtering of dynamic HTML 

Charles Reis, John Dunagan, Helen J. Wang, Opher Dubrovsky, Saher Esmeir 
September 2007 ACM Transactions on the Web (TWEB), volume l issue 3 

Publisher: ACM Press 

Full text available: *Q pdf ( 385.02 KB) Additional Information: full citation , abstract , references , index terms 

Vulnerability-driven filtering of network data can offer a fast and easy-to-deploy 
alternative or intermediary to software patching, as exemplified in Shield [Wang et al. 
2004]. In this article, we take Shield's vision to a new domain, inspecting and cleansing 
not just static content, but also dynamic content. The dynamic content we target is the 
dynamic HTML in Web pages, which have become a popular vector for attacks. The key 
challenge in filtering dynamic HTML is that it is undecidable to ... 
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41 Sync your data: update propa g ation for hetero g eneous protein databases 
T. Claypool, A. Rundensteiner 

September 2005 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 14 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available:^ pdf(2. 92 MB ) Additional Information: full citation , abstract 

The traditional model of bench (wet) chemistry in many life sciences domain is today 
actively complimented by computer-based discoveries utilizing the growing number of 
online data sources. A typical computer-based discovery scenario for many life scientists 
includes the creation of local caches of pertinent information from multiple online 
resources such as Swissprot [Nucleic Acid Res. 1(28), 45-48 (2000)], PIR [Nucleic Acids 
Res. 28(1), 41-44 (2000)], PDB [Th ... 

Keywords: Data transformation, Data translation, Schema evolution, Update propagation, 
View maintenance 



42 Di g ital libraries and cyberinfastructure track: creating information representations for 
the humanities (part 2): E-library of medieval chant manuscript transcriptions 
Louis W. G. Barton, John A. Caldwell, Peter G. Jeavons 

June 2005 Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries 
JCDL '05 

Publisher: ACM Press 

Full text available: pdf(668.36 KB) Additional Information: full citation , abstract , references , index terms 

In this paper we present our rationale and design principles for a distributed e-library of 
medieval chant manuscript transcriptions. We describe the great variety in neumatic 
notations, in order to motivate a standardised data representation that is lossless and 
universal with respect to these musical artefacts. We present some details of the data 
representation and an XML Schema for describing and delivering transcriptions via the 
Web. We argue against proposed data format ... 

Keywords: XML, chant, comparison, data representation, digital libraries, medieval 
manuscripts, musical notation, search, transcription 
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Publisher: ACM Press 
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Full text available:™ pdf(680.78 KB ) — 

terms 

In this paper we describe the Mobile Link (m-Links) infrastructure for utilizing existing 
World Wide Web content and services on wireless phones and other very small Internet 
terminals. Very small devices, typically with 3-20 lines of text, provide portability and 
other functionality while sacrificing usability as Internet terminals. In order to provide 
access on such limited hardware we propose a small device web navigation model that is 
more appropriate than the desktop computer's web brows ... 

Keywords: middleware, proxy, web phones, wireless, wireless web 
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Publisher: ACM Press 
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Virtually all proposals for querying XML include a class of query we term "containment 
queries". It is also clear that in the foreseeable future, a substantial amount of XML data 
will be stored in relational database systems. This raises the question of how to support 
these containment queries. The inverted list technology that underlies much of 
Information Retrieval is well-suited to these queries, but should we implement this 
technology (a) in a separate loosely-coupled IR engin ... 

45 Knowled g e and representation: Acquis ition, re presentation , q uery and analysis of 
s patial data: a demonstration 3D di g ital library 

Jeremy Rowe, Anshuman Razdan, Arleyn Simon 

May 2003 Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries 
JCDL "03 

Publisher: IEEE Computer Society 

Full text available: ^| pdf(7.27 MB) Additional Information: full citation , abstract , references , index terms 

The increasing power of techniques to model complex geometry and extract meaning from 
3D information create complex data that must be described, stored, and displayed to be 
useful to researchers. Responding to the limitations of two-dimensional (2D) data 
representations perceived by discipline scientists, the Partnership for Research in Spatial 
Modeling (PRISM) project at Arizona State University (ASU) developed modeling and 
analytic tools that raise the level of abstraction and add semantic val ... 

Keywords: WWW Applications, digital library, geometric modeling, image databases, 
information visualization, physically based modeling, scientific visualization, shape 
recognition 
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Daniel Brandon 

December 2001 Journal of Computing Sciences in Colleges, Volume 17 issue 2 
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47 Innovative Document Systems: The multivalent browser: a platform for new ideas 
Thomas A. Phelps, Robert Wilensky 

November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 
DocEng '01 

Publisher: ACM Press 

p ii i . .. a ,jr/ 4 oo c-i Additional Information: full citation, abstract, refer ences, citings, index 
Full text available: T^4pdf(1 88.51 KB) ~ 9 — — 

The Multivalent Browser is built on a architecture that separates functionality from 
concrete document format. Almost all functionality is made available via relatively small 
modules of code called behaviors that programmers can write to extend the core system. 
Behaviors can be as significant and powerful as parser-renderers for scanned paper, HTML, 
or TeX DVI; as fine-grained as hyperlinks, cookies, and the disabling of menu items; and 
as innovative or uncommon as in situ annotatins, "lenses", ... 

Keywords: annotation, architecture, digital, document, multivalent behavior, paper, 
scanned 



48 Ap plication of machine learning techniques to improve web search results Q 
Jessie Burger, Aaron Archer Waterman 

May 2003 Journal of Computing Sciences in Colleges, volume 18 issue 5 

Publisher: Consortium for Computing Sciences in Colleges 

Full text available: ^ pdf(18.26 KB) Additional Information: full citation , index terms 



49 Early user— system interaction for database selection in massive domain-specific 

^ online environments 
^ Jack G. Conrad, Joanne R. S. Claussen 

January 2003 ACM Transactions on Information Systems (TOIS), volume 21 issue 1 

Publisher: ACM Press 

Full text available: |!C| pdf(845.54 KB ) Additional Information: full citation , abstra ct, r eferenc es, index term s 

The continued growth of very large data environments such as Westlaw and Dialog, in 
addition to the World Wide Web, increases the importance of effective and efficient 
database selection and searching. Current research focuses largely on completely 
autonomous and automatic selection, searching, and results merging in distributed 
environments. This fully automatic approach has significant deficiencies, including reliance 
upon thresholds below which databases with relevant documents are not search ... 

Keywords: Database selection, metadata for retrieval, structuring information to aid 
search and navigation, user interaction 
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50 Strate g ic directions in electron ic co mmerce and di gita l libraries: towards a di gi ta l 
^ agora 

^ Nabil Adam, Yelena Yesha 

December 1996 ACM Computing Surveys (CSUR), volume 28 issue 4 

Publisher: ACM Press 

Full text available: Qpdf( 244.34 KB) Additional Information: full citation , references , citings, index terms 
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51 Evaluating the technologies: The text retrieval conferences (TRECS) Q 
Ellen M. Voorhees, Donna Harman 

October 1998 Proceedings of a workshop on held at Baltimore, Maryland: October 13- 
15, 1998 

Publisher: Association for Computational Linguistics 

Full text available: ^j|pdf (2.76 MB ) Additional Information: full citation , abstract , references , citin gs 

Phase III of the TIPSTER project included three workshops for evaluating document 
detection (information retrieval) projects: the fifth, sixth and seventh Text REtrieval 
Conferences (TRECs). This work was co-sponsored by the National Institute of Standards 
and Technology (NIST), and included evaluation not only of the TIPSTER contractors, but 
also of many information retrieval groups outside of the TIPSTER project. The conferences 
were run as workshops that provided a forum for participating gro ... 

52 Self H 
David Ungar, Randall B. Smith 

▼ June 2007 Proceedings of the third ACM SIGPLAN conference on History of 
programming languages HOPL III 

Publisher: ACM Press 

Full text available- « J>df(1 70 MB) Additional Information: full citation , appendices and supplements , 
^ abstract , references , index terms 

The years 1985 through 1995 saw the birth and development of the language Self, 
starting from its design by the authors at Xerox PARC, through first implementations by 
Ungar and his graduate students at Stanford University, and then with a larger team 
formed when the authors joined Sun Microsystems Laboratories in 1991. Self was 
designed to help programmers become more productive and creative by giving them a 
simple, pure, and powerful language, an implementation that combined ease of use wit ... 

Keywords: Self, adaptive optimization, cartoon animation, dynamic language, dynamic 
optimization, exploratory programming, history of programming languages, morphic, 
object-oriented language, programming environment, prototype-based programming 
language, virtual machine 

53 Summarization and question answerin g : Using librarian techniques in automatic text Q 
<gy summarization for information retrieval 

^ Min-Yen Kan, Judith L. Klavans 

July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 

JCDL '02 
Publisher: ACM Press 

r- ii* ^ -I ui 0i jr/nrMnv Additional Information: full citatio n, abs tract , referenc es, citin gs, index - 

Full text available: TO pdf(1.15 MB) — — 1 3 

^ terms 

A current application of automatic text summarization is to provide an overview of relevant 
documents coming from an information retrieval (IR) system. This paper examines how 
Centrifuser, one such summarization system, was designed with respect to methods used 
in the library community. We have reviewed these librarian expert techniques to assist 
information seekers and codified them into eight distinct strategies. We detail how we 
have operationalized six of these strategies in Centrifuser by c ... 

Keywords: automatic text summarization, information retrieval user interfaces, reference 
librarian techniques 
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November 2006 Proceedings of the 1st international workshop on Contextualized 

attention metadata: collecting, managing and exploiting of rich usage 
information CAMA '06 

Publisher: ACM Press 

Full text available: ^ pdf(564.39 KB) Additional Information: full citation , abstract , references , index terms 

To efficiently support personal ways of desktop usage, we have to unleash the power of 
implicit metadata thus giving local data a well defined meaning. To achieve this, 
contextual information across heterogeneous media types, file formats, and applications 
should be annotated and linked. In this paper we present a light weight system which 
monitors the file structure and automatically generates semantic metadata based on the 
user activities. We underpin the utility of extracted metadata by showi ... 

Keywords: contextualized metadata 



55 Ap proximative filtering of XML documents in a publish/subsc r ibe s ystem 
Annika Hinze, Yann Michel, Torsten Schlieder 

January 2006 Proceedings of the 29th Australasian Computer Science Conference - 
Volume 48 ACSC '06 

Publisher: Australian Computer Society, Inc. 

Full text available: ^ pdf ( 228.42 KB ) Additional Information: full citat ion , abstract, refe r ences , index t e r ms 

Publish/subscribe systems filter published documents and inform their subscribers about 
documents matching their interests. Recent systems have focussed on documents or 
messages sent in XML format. Subscribers have to be familiar with the underlying XML 
format to create meaningful subscriptions. A service might support several providers with 
slightly differing formats, e.g., several publishers of books. This makes the definition of a 
successful subscription almost impossible. This paper proposes ... 

56 E1HA?!?: d e p lo y ing Web and WAP services usin g XM L t e ch no logy 
Chiara Biancheri, Jean-Christophe Pazzaglia, Gavino Paddeu 
March 2001 ACM SIGMOD Record, volume 30 issue i 
Publisher: ACM Press 

Full text available: ^pdf(744.53 KB) Additional Information: full citation , abstract , index terms 

The exponential growth of resources on the web, and the wide deployment of devices for 
multimodal access to the Internet, lead to new problems in information management. In 
this context, and as part of the European project Vision, we have built an interactive 
telematic handbook of the culture and the territory of Sardinia. A team of cultural experts 
browsed the web to get a large collection of Internet resources.The system built for the 
management of this data uses emerging Internet technologies ... 

Keywords: DBMS, DTD, WAP, WML, XML, XSL, metadata, search engine 



57 Commercial applications of natural lan g ua g e pro c essin g 
Kenneth W. Church, Lisa F. Rau 

November 1995 Communications of the ACM, volume 38 issue n 
Publisher: ACM Press 

Full text available* fid odf(31 4 22 KB) Additional Information: full citation , abstract , ref e rence s, citings, index 
. • t erms 

Vast quantities of text are becoming available in electronic form, ranging from published 
documents (e.g., electronic dictionaries, encyclopedias, libraries and archives for 
information retrieval services), to private databases (e.g., marketing information, legal 
records, medical histories), to personal email and faxes. Online information services are 
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reaching mainstream computer users. There were over 15 million Internet users in 1993, 
and projections are for 30 million in 1997. With media ... 

58 Information inte g ration with attribution support for corporate profiles Q 
Thomas Lee, Melanie Chams, Robert Nado, Michael Siegel, Stuart Madnick 
November 1999 Proceedings of the eighth international conference on Information 

and knowledge management CIKM '99 
Publisher: ACM Press 

Full text available: 1 ^ pdf(845.25 KB) Additional Information: full citation , abstract , references , index terms 

The proliferation of electronically available data within large organizations as well as 
publicly available data (e.g. over the World Wide Web) poses challenges for users who 
wish to efficiently interact with and integrate multiple heterogeneous sources. This paper 
presents CI3, a corporate information integrator, which applies XML as a tool to facilitate 
data mediation and integration amongst heterogeneous sources in the context of financial 
analysts creating corporate ... 

Keywords: XML, attribution, data integration, data mediation, metadata 

59 Ag ents , interactions, mobility and systems: Guarding security sensitive content using Q 
^ confined mobile agents 

^ Guido van 't Noordende, Frances M. T. Brazier, Andrew S. Tanenbaum 

March 2007 Proceedings of the 2007 ACM symposium on Applied computing SAC '07 
Publisher: ACM Press 

Full text available: ^g] pdf(1 52.27 KB ) Additional Information: full citation , abstract , references , index terms 

Mobile code and mobile agents are generally associated with security vulnerabilities, rather 
than with increased security. This paper describes an approach in which mobile agents are 
confined, in order to allow content providers to retain control over how their data is 
exported while allowing agents to search the full content of this data locally. This approach 
offers increased control and security compared to the traditional client-server technologies 
commonly used for building distri ... 

Keywords: confinement, information flow control, mobile agents 

60 Buildin g and Using a Lexical Know l ed g e Base of Near-S y n o nym Di f fer ence s Q 
Diana Inkpen, Graeme Hirst 

June 2006 Computational Linguistics, volume 32 issue 2 
Publisher: MIT Press 

Full text available- 1 51 ! odf(3 60 MB) Additional Information: full citation , abstract , references , cited b y. index 
u e v i a J. terms 

The initial knowledge base is later enriched with information from other machine-readable 
dictionaries. Information about the collocational behavior of the near-synonyms is acquired 
from free text. The knowledge base is used by Xenon, a natural language generation 
system that shows how the new lexical resource can be used to choose the best near- 
synonym in specific situations. 
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61 Detection and evidence: Out-of-context noun phrase semantic interpretation with Q 
^ cross-linguistic evidence 
^ Roxana Girju 

November 2006 Proceedings of the 15th ACM international conference on Information 
and knowledge management CIKM '06 

Publisher: ACM Press 

Full text available: Qpdf(231.70 KB ) Additional Information: full citation , abstract , references , index terms 

The acquisition of semantic knowledge is paramount for any application that requires a 
deep understanding of natural language text. Motivated by the problem of building a noun 
phrase-level semantic parser and adapting it to various applications, such as machine 
translation and multilingual question answering, in this paper we present a domain- 
independent model for noun phrase semantic interpretation. We investigate the problem 
based on cross-linguistic evidence from a set of four Romance languag ... 

Keywords: SVM, classification, computational semantics, semantic relations 



62 Buildin g a distributed full-text index for the web 

Sergey Melink, Sriram Raghavan, Beverly Yang, Hector Garcia-Molina 
July 2001 ACM Transactions on Information Systems (TOIS), volume 19 issue 3 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 
terms , review 



Full text available: ffipdf(651.72 KB) 



We identify crucial design issues in building a distributed inverted index for a large 
collection of Web pages. We introduce a novel pipelining technique for structuring the core 
index-building system that substantially reduces the index construction time. We also 
propose a storage scheme for creating and managing inverted files using an embedded 
database system. We suggest and compare different strategies for collecting global 
statistics from distributed inverted indexes. Finally, we present pe ... 

Keywords: Distributed indexing, Embedded databases, Inverted files, Pipelining, Text 
retrieval 
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Charles Reis, John Dunagan, Helen J. Wang, Opher Dubrovsky, Saher Esmeir 
November 2006 Proceedings of the 7th symposium on Operating systems design and 
implementation OSDI '06 

Publisher: USENIX Association 

Full text available: ^ pdf(41 1 .90 KB) Additional Information: full citation , abstract , references 

Vulnerability-driven filtering of network data can offer a fast and easy-to-deploy 
alternative or intermediary to software patching, as exemplified in Shield [43]. in this 
paper, we take Shield's vision to a new domain, inspecting and cleansing not just static 
content, but also dynamic content. The dynamic content we target is the dynamic HTML in 
web pages, which have become a popular vector for attacks. The key challenge in filtering 
dynamic HTML is that it is undecidable to statically deter ... 

64 Usin g SGML as a basis for data-intensive NLP Q 
David McKelvie, Chris Brew, Henry Thompson 

March 1997 Proceedings of the fifth conference on Applied natural language 
processing 

Publisher: Morgan Kaufmann Publishers Inc. 

Full text available: ffi pdf ( 792. 46 KB) 

Jer Additional Information: f u ll citation , a b stract , reference s, cit in g s 

W$ P ub lish e r Site 

This paper describes the LT NSL system (McKelvie et al, 1996), an architecture for writing 
corpus processing tools. This system is then compared with two other systems which 
address similar issues, the GATE system (Cunningham et al, 1995) and the IMS Corpus 
Workbench (Christ, 1994). In particular we address the advantages and disadvantages of 
an SGML approach compared with a non-SGML database approach. 

65 Mergin g interactive v i sualizati o ns with hypertextbooks and course mana geme n t Q 
Guido RoBling, Thomas Naps, Mark S. Hall, Ville Karavirta, Andreas Kerren, Charles Leska, 

^ Andres Moreno, Rainer Oechsle, Susan H. Rodger, Jaime Urquiza-Fuentes, J. Angel 
Velazquez-Iturbide 

June 2006 ACM SIGCSE Bulletin , Working group reports on ITiCSE on Innovation and 

technology in computer science education ITiCSE-WGR '06, volume 38 issue 4 
Publisher: ACM Press 

Full text available: ^ pdf(574.51 KB ) Additional Information: full citation , abstract , references , index terms 

As a report of a working group at ITiCSE 2006, this paper provides a vision of how 
visualizations and the software that generates them may be integrated into 
hypertextbooks and course management systems. This integration generates a unique 
synergy that we call a Visualization-based Computer Science Hypertextbook (VizCoSH). By 
borrowing features of both traditional hypertextbooks and course management systems, 
VizCoSHs become delivery platforms that address some of the reasons why 
visualizations ... 

Keywords: animation, hypertextbooks, pedagogy, visualization 



66 Stora g e: DejaView: a personal virtual computer recorder 

Oren Laadan, Ricardo A. Baratto, Dan B. Phung, Shaya Potter, Jason Nieh 
October 2007 Proceedings of twenty-first ACM SIGOPS symposium on Operating 

systems principles SOSP 07 
Publisher: ACM Press 

Full text available: ^ pdf(534.51 KB ) Additional Information: full citation , abstract , references , index terms 

As users interact with the world and their peers through their computers, it is becoming 
important to archive and later search the information that they have viewed. We present 
DejaView, a personal virtual computer recorder that provides a complete record of a 
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desktop computing experience that a user can playback, browse, search, and revive 
seamlessly. DejaView records visual output, checkpoints corresponding application and file 
system state, and captures displayed text with contextua ... 

Keywords: desktop search, virtualization 



67 Linkin g by inkin g : trailblazin g in a paper-like hy pertext Q 
Morgan N. Price, Gene Golovchinsky, Bill N. Schilit 

May 1998 Proceedings of the ninth ACM conference on Hypertext and hypermedia : 
links, objects, time and space — structure in hypermedia systems: links, 
objects, time and space — structure in hypermedia systems HYPERTEXT 
'98 

Publisher: ACM Press 

Full text available: ^pdf (1.46 MB) Additional Information: full citation , references , citings, index terms 




68 Desparatel y seeking Cebuano I 
Douglas W. Oard, David Doermann, Bonnie Dorr, Daqing He, Philip Resnik, Amy Weinberg, 
William Byrne, Sanjeev Khudanpur, David Yarowsky, Anton Leuski, Philipp Koehn, Kevin 
Knight 

May 2003 Proceedings of the 2003 Conference of the North American Chapter of the 
Association for Computational Linguistics on Human Language 
Technology: companion volume of the Proceedings of HLT-NAACL 2003— 
short papers - Volume 2 NAACL '03 

Publisher: Association for Computational Linguistics 

Full text available: ^ pdf(38.Q4 KB) Additional Information: full citation , abstract 

This paper describes an effort to rapidly develop language resources and component 
technology to support searching Cebuano news stories using English queries. Results from 
the first 60 hours of the exercise are presented. 

69 Ex pressive retrieval from XML documents i 
jfy Taurai Tapiwa Chinenyanga, Nicholas Kushmerick 

^ September 2001 Proceedings of the 24th annual international ACM SIGIR conference 
on Research and development in information retrieval SIGIR '01 

Publisher: ACM Press 

p •• , . i u. « ^Ann i/n^ Additional Information: full citation, abstract, references, citings, index 

Full text available: fn pdf(400.63 KB) ~— 

^ terms 

The emergence of XML as a standard interchange format for structured documents/data 
has given rise to many XML query language proposals. However, some of these languages 
do not support information retrieval-style ranked queries based on textual similarity. There 
have been several extensions to these query languages to support keyword search, but 
the resulting query languages cannot express queries such as' 'find books and CDs with 
similar titles". Either these extensions u ... 

70 Int e g rating document and data retrieval based on XML 
Jan-Marco Bremer, Michael Gertz 

January 2006 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 15 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(841.10 KB ) Additional Information: full citation , abstract 

For querying structured and semistructured data, data retrieval and document retrieval 
are two valuable and complementary techniques that have not yet been fully integrated. 
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In this paper, we introduce integrated information retrieval (IIR), an XML-based retrieval 
approach that closes this gap. We introduce the syntax and semantics of an extension of 
the XQuery language called XQuery/IR. The extended language realizes IIR and thereby 
allows users to formulate new kinds of queries by nesting rank ... 

Keywords: Data retrieval, Document retrieval, Index structures, Integrated information 
retrievals, Structural join, XML 



71 Gra ph based retrieval (IR): Dex: hi gh- performance exploration on lar ge gra phs for 
information retrieval 

Norbert Martfnez-Bazan, Victor Muntes-Mulero, Sergio Gomez-Villamor, Jordi Nin, Mario-A. 
Sanchez-Martinez, Josep-L. Larriba-Pey 

November 2007 Proceedings of the sixteenth ACM conference on Conference on 
information and knowledge management CIKM '07 

Publisher: ACM 

Full text available:^ p df ( 50 0. 02 KB ) Additional Information: full citation , abstract , re ferenc es, index terms 

Link and graph analysis tools are important devices to boost the richness of information 
retrieval systems. Internet and the existing social networking portals are just a couple of 
situations where the use of these tools would be beneficial and enriching for the users and 
the analysts. However, the need for integrating different data sources and, even more 
important, the need for high performance generic tools, is at odds with the continuously 
growing size and number of data repositories. 



Keywords: data representation, graph databases, information retrieval, query 
performance, social networks 



72 Research articles and surveys: Peer-to-peer management of XML data: issues and Q 
^ research challen g es 

^ Georgia Koloniari, Evaggelia Pitoura 

June 2005 ACM SIGMOD Record, Volume 34 Issue 2 

Publisher: ACM Press 

Full text available: Q pdf(301.94 KB ) Additional Information: f ull c it a tion, a bst ract, r e ferences , index te rms 

Peer-to-peer (p2p) systems are attracting increasing attention as an efficient means of 
sharing data among large, diverse and dynamic sets of users. The widespread use of XML 
as a standard for representing and exchanging data in the Internet suggests using XML for 
describing data shared in a p2p system. However, sharing XML data imposes new 
challenges in p2p systems related to supporting advanced querying beyond simple 
keyword-based retrieval. In this paper, we focus on data management issues fo ... 

73 A statistical model for near-synonym choice Q 
Diana Inkpen 

January 2007 ACM Transactions on Speech and Language Processing (TSLP), volume 4 

Issue 1 
Publisher: ACM Press 

Full text available: ^ . pdf(312.05 KB ) Additional Information: full citation, abstract , references , index terms 

We present an unsupervised statistical method for automatic choice of near-synonyms 
when the context is given. The method uses the Web as a corpus to compute scores based 
on mutual information. Our evaluation experiments show that this method performs better 
than two previous methods on the same task. We also describe experiments in using 
supervised learning for this task. We present an application to an intelligent thesaurus. 
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This work is also useful in machine translation and natural language ... 

Keywords: Lexical choice, Web as a corpus, intelligent thesaurus, near-synonyms, 
semantic similarity 



Tutorials: tutorial 1: Towards an enterprise XML architecture Q 
Ravi Murthy, Zhen Hua Liu, Muralidhar Krishnaprasad, Sivasankaran Chandrasekar, Anh- 
Tuan Tran, Eric Sedlar, Daniela Florescu, Susan Kotsovolos, Nipun Agarwal, Vikas Arora, 
Viswanathan Krishnamurthy 

June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 
Management of data SIGMOD 05 

Publisher: ACM Press 

Full text available: ^| pdf(260.60 KB) Additional Information: full citation , abstract, references , citings 

XML is being increasingly used in diverse domains ranging from data and application 
integration to content management. Oracle provides an enterprise wide platform for 
managing all types of XML content. Within the Oracle database and the application server, 
the XML content can be efficiently stored using a variety of storage and indexing methods 
and it can be processed using multiple standard languages within different programmatic 
environments. 

75 The interagency digital library for science and engineering: a federated digital library Q 

<H> pilot for the U.S. g overnment sc i ent i st 
^ Blaine Baker, John Salerno 

August 1999 Proceedings of the fourth ACM conference on Digital libraries DL '99 

Publisher: ACM Press 

Full text available: ^ pd f(25.36 KB ) Additional Information: full citation , references , index terms 



Keywords: government scientific researchers, government security levels, heterogeneous 
database searching, interoperable digital libraries, universal log-on 

76 Communities: Blogging as social activity, or, would vou let 900 million people read Q 
your diary ? 

^ Bonnie A. Nardi, Diane J. Schiano, Michelle Gumbrecht 

November 2004 Proceedings of the 2004 ACM conference on Computer supported 

cooperative work CSCW '04 
Publisher: ACM Press 

Full text available: Q pdf(376.63 KB) Additional Information: M^ation, abstract, references , citings, index 

"Blogging" is a Web-based form of communication that is rapidly becoming mainstream. In 
this paper, we report the results of an ethnographic study of blogging, focusing on blogs 
written by individuals or small groups, with limited audiences. We discuss motivations for 
blogging, the quality of social interactivity that characterized the blogs we studied, and 
relationships to the blogger's audience. We consider the way bloggers related to the 
known audience of their personal social networks as ... 

Keywords: WWW, activity theory, blogs, computer-mediated communication 
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78 EXACT: the experimental al g orithmics computational toolkit Q 
^ William E. Hart, Jonathan W. Berry, Robert Heaphy, Cynthia A. Phillips 
v June 2007 Proceedings of the 2007 workshop on Experimental computer science 
ExpCS '07 
Publisher: ACM Press 

Full text available: ^pdf(231.70 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we introduce EXACT, the Experimental Algorithmics Computational Toolkit. 
EXACT is a software framework for describing, controlling, and analyzing computer 
experiments. It provides the experimentalist with convenient software tools to ease and 
organize the entire experimental process, including the description of factors and levels, 
the design of experiments, the control of experimental runs, the archiving of results, and 
analysis of results. 

As a case study for EXACT, we ... 

Keywords: experimental analysis, experimental design, software testing 



79 XML q uer y processing: Query processing of streamed XML data 

Leonidas Fegaras, David Levine, Sujoe Bose, Vamsi Chaluvadi 
>r November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management CIKM '02 

Publisher: ACM Press 
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Full text available: W\ pdf(246.55 KB) a a 

^ terms 

We are addressing the efficient processing of continuous XML streams, in which the server 
broadcasts XML data to multiple clients concurrently through a multicast data stream, 
while each client is fully responsible for processing the stream. In our framework, a server 
may disseminate XML fragments from multiple documents in the same stream, can repeat 
or replace fragments, and can introduce new fragments. or delete invalid ones. A client 
uses a light-weight database based on our proposed XML alge ... 

Keywords: XML, databases, query optimization, query processing 
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81 An XML framework for agent-based E-commerce 
Robert J. Glushko, Jay M. Tenenbaum, Bart Meltzer 
March 1999 Communications of the ACM, volume 42 issue 3 

Publisher: ACM Press 
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82 Strategic directions in database systems — breakin g out of the box 
Avi Silberschatz, Stan Zdonik 

December 1996 ACM Computing Surveys (CSUR), volume 28 issue 4 
Publisher: ACM Press 

Full text available:^ pdf(222.64 KB) Additional Information: full citation , references , ci tings, index terms 



83 Information exploration usin g The Pond 

Olov Stahl, Anders Wallberg, Jonas Soderberg, Jan Humble, Lennart E. Fahlen, Adrian 
Bullock, Jenny Lundberg 

September 2002 Proceedings of the 4th international conference on Collaborative 
virtual environments CVE '02 

Publisher: ACM Press 

Additional Information: full citation , abstr act, references , citing s, index 
terms 



Full text available: "g|pdf(2.38 MB) 



In this paper we describe The Pond, a system used to search for and visualise data 
elements on an engaging tabletop display. The Pond uses methods of unencumbered 
interaction and audio feedback to allow users to investigate data elements, and supports 
shoulder-to-shoulder collaboration with the physical Pond artefact mediating the 
collaboration between those people gathered around it. The user interface is based on an 
ecosystem metaphor, presenting data elements in the form of shoals of aquatic ... 

Keywords: database, searching, virtual environment, visualization 
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Jonathan Trevor, David M. Hilbert, Bill N. Scbilit, Tzu Khiau Koh 

November 2001 Proceedings of the 14th annual ACM symposium on User interface 
software and technology UIST '01 

Publisher: ACM Press 

c ii* ^ -i~ui «m jr/-, o>i y D \ Additional Information: full citation , abstract , references , citin gs , index 
Full text available: ]||„ pdf( 1 . 34 MB ) terms 

While it is generally accepted that new Internet terminals should leverage the installed 
base of Web content and services, the differences between desktop computers and very 
small devices makes this challenging. Indeed, the browser interaction model has evolved 
on desktop computers having a unique combination of user interface (large display, 
keyboard, pointing device), hardware, and networking capabilities. In contrast, Internet 
enabled cell phones, typically with 3-10 lines of text, sacrifice ... 

Keywords: PDA, Web browsing, transcoding, transducing, web phone, wireless web 
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December 2006 Linux Journal, volume 2006 issue 152 
Publisher: Specialized Systems Consultants, Inc. 

Full text available: g] html(10.17 KB ) Additional Information: full citation , a bstract , index term s 
Linux Journal editors pick their favorites. 

86 Database systems — breaking out of the box Q 
Avi Silberschatz, Stan Zdonik 

September 1997 ACM SIGMOD Record, volume 26 issue 3 
Publisher: ACM Press 

Full text available: f^ pdfd.23 MB) Additional Information: full citation , citings, index terms 




87 An Internet-based ne g otiat i o n server for e - commerce Q 

Stanley Y.W. Su, Chunbo Huang, Joachim Hammer, Yihua Huang, Haifei Li, Liu Wang, 

Youzhong Liu, Charnyote Pluempitiwiriyawej, Minsoo Lee, Herman Lam 

August 2001 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 10 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: H) _edf(355J9_KB) Additional Information: ful l citation , abst ract, citings, in de x te r m s 

This paper describes the design and implementation of a replicable, Internet-based 
negotiation server for conducting bargaining-type negotiations between enterprises 
involved in e-commerce and e-business. Enterprises can be buyers and sellers of 
products/services or participants of a complex supply chain engaged in purchasing, 
planning, and scheduling. Multiple copies of our server can be installed to complement the 
services of Web servers. Each enterprise can install or select a trusted negotia ... 

Keywords: Constraint evaluation, Cost- benefit analysis, Database, E-commerce, 
Negotiation policy and strategy, Negotiation protocol 
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November 2007 Proceedings of the ACM first Ph.D. workshop in CIKM PIKM '07 
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Publisher: ACM 

Full text available: *^ pdf(1 79.57 KB) Additional Information: full citation , abstract , references , index terms 

Paraphrasing van Rijsbergen [37], the time is ripe for another attempt at using natural 
language processing (NLP) for information retrieval (IR). This paper introduces my 
dissertation study, which will explore methods for integrating modern NLP with state-of- 
the-art IR techniques. In addition to text, I will also apply retrieval to conversational 
speech data, which poses a unique set of considerations in comparison to text. Greater 
use of NLP has potential to improve both text and speech retr ... 

Keywords: information retrieval, natural language processing 



89 Document Databases: Requirements for XML document database systems 
Airi Salminen, Frank Wm. Tompa 

November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 
DocEng '01 

Publisher: ACM Press 

Full text available* fi?|pdf(141 qq KB) Additional Information: full citation , abstract , references , citing s, index 

: terms 

The shift from SGML to XML has created new demands for managing structured 
documents. Many XML documents will be transient representations for the purpose of data 
exchange between different types of applications, but there will also be a need for 
effective means to manage persistent XML data as a database. In this paper we explore 
requirements for an XML database management system. The purpose of the paper is not 
to suggest a single type of system covering all necessary features. Instead the pur ... 

Keywords: XML, XML database systems, data definition, data manipulation, data 
modelling, structured documents 



90 Pervasive computing: what is it good for? 

# Andrew C. Huang, Benjamin C. Ling, Shankar Ponnekanti 
August 1999 Proceedings of the 1st ACM international workshop on Data engineering 

for wireless and mobile access MobiDe '99 
Publisher: ACM Press 

Full text available: ^ pd f( 897.82 KB ) Additional Information: f u ll citation , references , citings , i ndex terms 



91 Blueprint for a hi g h performance NLP infrastructure 
James R. Curran 

May 2003 Proceedings of the HLT-NAACL 2003 workshop on Software engineering 
and architecture of language technology systems - Volume 8 SEALTS '03 

Publisher: Association for Computational Linguistics 

Full text available: ^ pdf(92.57 KB) Additional Information: full citation , abstract , references 

Natural Language Processing (NLP) system developers face a number of new challenges. 
Interest is increasing for real-world systems that use. NLP tools and techniques. The 
quantity of text now available for training and processing is increasing dramatically. Also, 
the range of languages and tasks being researched continues to grow rapidly. Thus it is an 
ideal time to consider the development of new experimental frameworks. We describe the 
requirements, initial design and exploratory implementation ... 
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Distance education in a g lobal age: a perspective for internationalizin g online 
learnin g communities 
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j< Kirk St. Amant 

<^ January 2005 ACM SIGGROUP Bulletin, volume 25 issue l 
Publisher: ACM Press 

Full text available: ^ pdf(223.85 KB ) Additional Information: full citation , abstract , references , index terms 

As global online access grows, so does the prospect that online classes will contain 
students from various cultures and countries. Within this educational environment, cultural 
differences can affect how students share ideas, comprehend concepts, and access course- 
related materials. Effective instruction in international online classes therefore requires 
instructors to address those cultural differences that could most greatly affect learning in 
such environments. This essay provides an overview ... 

Keywords: communication, culture, international online learning environment, language, 
online access, resources, rhetoric, strategy, visual design 



93 Session 4: Genl: natural langua g e g eneration in Haskell 
Eric Kow 

September 2006 Proceedings of the 2006 ACM SIGPLAN workshop on Haskell Haskell 
'06 

Publisher: ACM Press 

Full text available: ^ pdf(973.28 KB) Additional Information: full citation , abstract , references , index terms 

In this article we present Genl, a chart based surface realisation tool implemented in 
Haskell. Genl takes as input a set of first order terms (the input semantics) and a 
grammar for a given target language (e.g., English, French, Spanish, etc.) and generates 
sentences in the target language, whose semantic meaning corresponds to the input 
semantics.The aim of the article is not so much to present Genl or to describe how it is 
implemented. Rather, we will focus on the aspects of functional progr ... 

Keywords: Haskell, applications, computational linguistics, monads, profiling, realisation, 
surface, typeclasses 



94 Q focus: databases: Beyond relational databases 
Margo Seltzer 

April 2005 Queue, volume 3 issue 3 
Publisher: ACM Press 

Full text available: g- gg.63 KB) A(Jdjtjona| | nformation: ful | cita tio n, abstract , references , i ndex terms 
There is more to data access than SQL. 

95 Overview of some patterns for architecting and managing composite web services 
B. Benatallah, M. Dumas, M.-C. Fauvet, F. A. Rabhi, Quan Z. Sheng 
June 2002 ACM SIGecom Exchanges, volume 3 issue 3 

Publisher: ACM Press 

Full text available: « pdf(1 26.49 KB) Additional Information: fujdtaiion, abstract, references , citings, index 
^-r— _/. terms 

The composition of Web services has gained a considerable momentum as a paradigm for 
enabling Business-to-Business (B2B) Collaborations. Numerous technologies supporting 
this new paradigm are rapidly emerging, thereby creating a need for methodologies that 
bring these technologies together. The identification and documentation of relevant 
patterns, both at the analysis and design levels, is an important step in this direction. 

Keywords: B2B e-commerce, design patterns, web services 
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96 Design and implementation of a distributed virtual machine for networked computers 
Emin Gun Sirer, Robert Grimm, Arthur J. Gregory, Brian N. Bershad 
December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the 

seventeenth ACM symposium on Operating systems principles SOSP 

'99, Volume 33 Issue 5 
Publisher: ACM Press 

Full text available* ff3 pdfd 62 MB) Additional Information: full citation , abstract , references , citing s, index 
" »2d p terms 

This paper describes the motivation, architecture and performance of a distributed virtual 
machine (DVM) for networked computers. DVMs rely on a distributed service architecture 
to meet the manageability, security and uniformity requirements of large, heterogeneous 
clusters of networked computers. In a DVM, system services, such as verification, security 
enforcement, compilation and optimization, are factored out of clients and located on 
powerful network servers. This partitioning of system fun ... 

97 S urfing the net for software en g ineerin g notes: Surfin g the net for software 
^ engineerin g notes 

^ Mark Doernhoefer 

September 2006 ACM SIGSOFT Software Engineering Notes, volume 3i issue 5 

Publisher: ACM Press 

Full text available: ^pdf(1.15 MB) Additional Information: full citation , abstract , index terms 

I mention many different free open source software (FOSS) products in this column in 
conjunction with the topics I select each month. This month, instead of focusing on a 
single topic, I thought I'd present a collection of some of the most popular free and open 
source software products currently in production in order to give you an impression on the 
size and scope of FOSS packages available. 

98 First European workshop on XML and knowled g e mana g ement best papers: XML 
and the future of humanities computin g 
Franco Niccolucci 

April 2002 ACM SIGAPP Applied Computing Review, volume 10 issue l 
Publisher: ACM Press 

i- ii* ^ -i ui 0 . WiUonl/m Additional Information: full citation , abstract , references , citings , index 
Full text available: TO pdT (44.8Q KB ) ± 

^ terms 

The existence of XML induces to hope that some limits of humanities computing may soon 
be trespassed. Here we mention some arguments concerning data management and 3D 
visualization, describing a few examples and test cases where the use of XML dramatically 
improved the quality of the application. These include text encoding, archaeological data 
management and Virtual Reality reconstruction of Cultural Heritage. 

Keywords: X3D, archaeological databases, cultural virtual reconstructions, medieval 
history 



99 Web 2.0 and accessibility: A web accessibility re port card for top international 
university web sites 

Shaun K. Kane, Jessie A. Shulman, Timothy J. Shockley, Richard E. Ladner 
May 2007 Proceedings of the 2007 international cross-disciplinary conference on 

Web accessibility (W4A) W4A '07 
Publisher: ACM Press 

Full text available: ^]pdf (517.10 KB) Additional Information: full citation , abstract , references , index terms 
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University web pages play a central role in the activities of current and prospective 
postsecondary students. University sites that are not accessible may exclude people with 
disabilities from participation in educational, social and professional activities. In order to 
assess the current state of university web site accessibility, we performed a multi-method 
analysis of the home pages of 100 top international universities. Each site was analyzed 
for compliance with accessibility standards, i ... 

Keywords: WCAG, accessibility, education, evaluation, section 508, web 

100 Integrating open hypermedia systems with the World Wide Web Q 
^£sv Kenneth M. Anderson 

V April 1997 Proceedings of the eighth ACM conference on Hypertext HYPERTEXT '97 
Publisher: ACM Press 

Full text available: ^ pdf(1.00 MB) Additional Information: full citation , references , citings , index terms 



Keywords: Chimera, World Wide Web, integration, open hypermedia systems 
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1 01 Informa ti on retrieval 2: Learnin g metadata from th e evidence in an on -li ne citation 
matching scheme 

Isaac G. Council!, Huajing Li, Ziming Zhuang, Sandip Debnath, Levent Bolelli, Wang Chien 
Lee, Anand Sivasubramaniam, C. Lee Giles 

June 2006 Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries 
JCDL '06 

Publisher: ACM Press 

Full text available: ^ pdf ( 431.06 KB ) Additional Information: f ull c itat i on, abstrac t, re ferences , ind ex t e rms 

Citation matching, or the automatic grouping of bibliographic references that refer to the 
same document, is a data management problem faced by automatic digital libraries for 
scientific literature such as CiteSeer and Google Scholar. Although several solutions have 
been offered for citation matching in large bibliographic databases, these solutions 
typically require expensive batch clustering operations that must be run offline. Large 
digital libraries containing citation information can reduce ... 

Keywords: CiteSeer, bayesian inference, citation matching 

102 Posters: Multilin g ual distance learnin g for_eng|neering 
K. W. E. Cheng, K. F. Kwok 

November 2002 Companion of the 17th annual ACM SIGPLAN conference on Object- 
oriented programming, systems, languages, and applications OOPSLA 
02 

^ Publisher: ACM Press 

Full text available: ^|pdf(3 69.01 KB ) Additional Information: full citation , a bstrac t, re feren ces 

An advanced distance learning method is introduced that is especially programmed for 
non-native English speakers. Many language features have been implemented to arouse 
undergraduate students' interests in learning. The package is also programmed to help 
professors to teach much easier than before. 



Keywords: distance-learning, engineering, teaching and learning 



Video: Effects of audio and visual surrogates for making sense of digital video Q 
Yaxiao Song, Gary Marchionini 

April 2007 Proceedings of the SIGCHI conference on Human factors in computing 
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systems CHI "07 

Publisher: ACM Press 

Full text available: Q pdf(325.37 KB) Additional Information: full citation , abstract , references , index terms 

Video surrogates are meant to help people quickly make sense of the content of a video 
before downloading or seeking more detailed information. In this paper we present the 
results of a study comparing the effectiveness of three different surrogates for objects in 
digital video libraries. Thirty-six people participated in a within subjects user study in 
which they did five tasks for each of three surrogate alternatives: visual alone (a 
storyboard), audio alone (spoken description), and combin ... 

Keywords: dual coding, multimedia, video surrogates 



104 Database and digital library technologies: Bulkloading and maintaining XML 
^ documents 

^ Albrecht Schmidt, Martin Kersten 

March 2002 Proceedings of the 2002 ACM symposium on Applied computing SAC '02 

Publisher: ACM Press 

Full text available- 13 df(55 5 23 KB) Additional Information: full citation , abstract , re ference s, citings, index 
' T^r 6 —* : terms 

The popularity of XML as a exchange and storage format brings about massive amounts of 
documents to be stored, maintained and analyzed — a challenge that traditionally has 
been tackled with Database Management Systems (DBMS). To open up the content of XML 
documents to analysis with declarative query languages, efficient bulk loading techniques 
are necessary. Database technology has traditionally been offering support for these tasks 
but yet falls short of providing efficient automation techniqu ... 

Keywords: XML, document databases, document warehouses, maintenance, relational 
databases 



105 Workshop on compositional software architectures: workshop report Q 
May 1998 ACM SIGSOFT Software Engineering Notes, volume 23 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(2.91 MB) Additional Information: full citation , index terms 




106 Technolo g y to enable learnin g I: SVG for educational simulations 
4fy Daniel S. Bogaard, Ronald P. Vullo, Christopher D. Cascioli 

October 2004 Proceedings of the 5th conference on Information technology 
education CITC5 '04 

Publisher: ACM Press 

Full text available: ^ pdf(234.58 KB) Additional Information: full citation , abstract , references , index terms 

Helping students to understand complex ideas will always be problematic for teaching 
professionals. Often, the students can be limited by not only their imagination, but by 
their experiences. When trying to explain something that is outside of the students' 
imagination, it is often helpful to have either simple animations or even interactive 
simulations that the students can explore. The creation of interactive environments and 
the use of animation can greatly help educators get their point a ... 

Keywords: animation, instructional software, scalable vector graphics, simulations 
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December 1999 netWorker, volume 3 issue 4 
Publisher: ACM Press 
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108 Building a hvpertextual digital library in the humanities: a case study on London 
Gregory Crane, David A. Smith, Clifford E. Wulfman 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital 
libraries J'CDL '01 

Publisher: ACM Press 

i- ii* ^ -i ui fiii M Additional Information: full cit ation, abstr act, refe rence s, citings, index 

Full text available: TO pdf (361.82 KB) x — — . 

10 terms 

This paper describes the creation of a new humanities digital library collection: 11,000,000 
words and 10,000 images representing books, images and maps on pre-twentieth century 
London and its environs. The London collection contained far more dense and precise 
information than the materials from the Greco-Roman world on which we had previously 
concentrated. The London collection thus allowed us to explore new problems of data 
structure, manipulation, and visualization. This paper contrast ... 

Keywords: automatic linking, browsing, collection development, document design, 
reading 



109 Interactin g with the WWW: Looking for convenient alternatives to forms for queryin g jjj 
remote databases on the Web: a new iconic interface fo r p r o gre ssive queries 

" Fabrizio Capobianco, Mauro Mosconi, Lorenzo Pagnin 

May 1996 Proceedings of the workshop on Advanced visual interfaces AVI '96 

Publisher: ACM Press 

Full text available:^ pdf( 1.11 MB) Additional Information: full citation , abstract , references 

The enormous popularity of the World Wide Web has made putting public access 
databases on the Web practically mandatory. Forms embedded within the Web clients 
(e.g. Netscape) are therefore emerging as the most common interfaces in database 
querying. Should this solution be considered completely satisfactory?We highlight some of 
the important limits we experienced with forms and we propose a convenient alternative 
solution, based on direct manipulation of icons. The system we have developed is ea ... 

110 GeoName: a system for back-transliterating Pinyin place names HI 
Kui Lam Kwok, Qiang Deng 

May 2003 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic 

references - Volume 1 
Publisher: Association for Computational Linguistics 

Full text available: ^g| pdf(1 19.91 KB ) Additional Information: full citat ion, abstract , references 

To be unambiguous about a Chinese geographic name represented in English text as 
Pinyin, one needs to recover the name in Chinese characters. We present our approach to 
this back-transliteration problem based on processes such as bilingual geographic name 
lookup, name suggestion using place name character and pair frequencies, and 
confirmation via a collection of monolingual names or the WWW. Evaluation shows that 
about 48% to 72% of the correct names can be recovered as the top candidate, and 8 ... 
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A Benchmark Suite for SOAP-based Communication in Grid Web Services 
Michael R. Head, Madhusudhan Govindaraju, Aleksander Slominski, Pu Liu, Nayef Abu- 
Ghazaleh, Robert van Engelen, Kenneth Chiu, Michael J. Lewis 

November 2005 Proceedings of the 2005 ACM/IEEE conference on Supercomputing SC 
05 

Publisher: IEEE Computer Society 

Full text available: ^| pdf(337.08 KB) Additional Information: full citation , abstract , index terms 

The convergence of Web services and grid computing has promoted SOAP, a widely used 
Web services protocol, into a prominent protocol for a wide variety of grid applications. 
These applications differ widely in the characteristics of their respective SOAP messages, 
and also in their performance requirements. To make the right decisions, an application 
developer must thus understand the complex dependencies between the SOAP 
implementation and the application. We propose a standard benchmark suite ... 

112 Performance: Clarifying the fundamentals of HTTP 
Jeffery C. Mogul 

May 2002 Proceedings of the 11th international conference on World Wide Web 
WWW 02 

Publisher: ACM Press 

c ii * ^ , « ,, MC7 on Additional Information: full citation, abstract, references, citings, index 

Full text available: 1Tj Pdf(157.39 KB) 

terms 

The simplicity of HTTP was a major factor in the success of the Web. However, as both the 
protocol and its uses have evolved, HTTP has grown complex. This complexity results in 
numerous problems, including confused implementors, interoperability failures, difficulty in 
extending the protocol, and a long specification without much documented rationale. Many 
of the problems with HTTP can be traced to unfortunate choices about fundamental 
definitions and models. This paper analyzes the current (HTTP ... 

Keywords: HTTP, protocol design 



113 Characterizing the memory behavior of Java workloads: a structured view and 
^ op portunities for op timi zations 

^ Yefim Shuf, Mauricio J. Serrano, Manish Gupta, Jaswinder Pal Singh 

June 2001 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 
2001 ACM SIGMETRICS international conference on Measurement and 
modeling of computer systems SIGMETRICS '01, volume 29 issue l 
Publisher: ACM Press 

Full text available: ^| pdf (1.55 MB) Additional Information: full ci tation , ab strac t , re ferences , c itin gs 

This paper studies the memory behavior of important Java workloads used in 
benchmarking Java Virtual Machines (JVMs), based on instrumentation of both application 
and library code in a state-of-the-art JVM, and provides structured information about 
these workloads to help guide systems' design. We begin by characterizing the inherent 
memory behavior of the benchmarks, such as information on the breakup of heap 
accesses among different categories and on the hotness of references to fields and met ... 

114 Document formatting: Using SVG as the rendering model for structured and 
^ gra phically complex web material 

^ Julius C. Mong, David F. Brailsford 

November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

DocEng '03 
Publisher: ACM Press 

Full text available' 155 df{124 14 KB) Additional Information: full citation , abstract , references , citing s, index 
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This paper reports some experiments in using SVG (Scalable Vector Graphics), rather than 
the browser default of (X)HTML/CSS / as a potential Web-based rendering technology, in 
an attempt to create an approach that integrates the structural and display aspects of a 
Web document in a single XML-compliant envelope. Although the syntax of SVG is XML 
based, the semantics of the primitive graphic operations more closely resemble those of 
page description languages such as PostScript or PDF. The principa ... 

Keywords: PDF, SVG, XML, vector graphics 



115 PMML and UIMA based frameworks for deploying analytic a p plications and services Q 



>^ August 2006 Proceedings of the 4th international workshop on Data mining 
standards, services and platforms DMSSP '06 

Publisher: ACM Press 

Full text available: ^ pdf(339.74 KB) Additional Information: full citation , abstract , references 

It is convenient to divide data into structured data, semi-structured data and unstructured 
data. By structured data, we mean data that is organized into fields or attributes. 
Examples include database records. Semi-structured data has attributes but does not have 
the regularity of structured data. Data defined by HTML or XML tags are examples of semi : 
structured data. Unstructured data lacks attributes or fields and includes text data, 
signals, images, video, audio or similar data. Of course, ... 

1 1 6 Re sear ch centers: Database research at the University of I lli noi s at Urba na- 
Champaig n 

^ M. Winslett, K. Chang, A. Doan, J. Han, C. Zhai, Y. Zhou 
September 2002 ACM SIGMOD Record, volume 31 issue 3 

Publisher: ACM Press 

Full text available: pdf(668.38 KB) Additional Information: full citation , references 
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Unicode is becoming a dominant character representation format for information 
processing. This presents a very dangerous usability and security problem for many 
applications. The problem arises because many characters in the UCS (Universal Character 
Set) are visually and/or semantically similar to each other. This presents a mechanism for 
malicious people to carry out Unicode Attacks, which include spam attacks, phishing 
attacks, and web identity attacks. In this paper, we address the potential ... 
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This paper postulates that for the Semantic Web to grow and gain input from fields that 
will surely benefit it, it needs to develop an analogue that will help people not only 
understand what it is, but what the potential opportunities are that are enabled by these 
new protocols. The model proposed in the paper takes the way that Web interaction has 
been framed as a baseline to inform a similar analogue for the Semantic Web. While the 
Web has been represented as a Page + Links, the paper prese ... 
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In Spring 2003, Joe Hellerstein at Berkeley and Natassa Ailamaki at CMU collaborated in 
designing and running parallel editions of an undergraduate database course that exposed 
students to developing code in the core of a ful-function database system. As part of this 
exercise, our course teams developed new programming projects based on the 
PostgreSQL open-source DBMS. This report describes our experience with this effort. 
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In the modern Web, users are accessing their favourite Web applications from any place, 
at any time and with any device. In this setting, they expect the application to user-tailor 
and personalize content access upon their particular needs. Exhibiting some kind of user- 
and context-dependency is thus crucial in Web Engineering. In this research, we focus on 
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reseach community. In this paper we develop an XML Document Type Definition (DTD) for 
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representing the schema of a Role-based Access Control (RBAC) Model and a conforming 
XML document containing the actual RBAC-based access control data for a commercial 
banking application. Based on this DTD, the XML document and the methods ... 
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Database system designers have traditionally had trouble with the default services and 
interfaces provided by operating systems. In recent years, developers and enthusiasts 
have increasingly promoted Java as a serious platform for building data-intensive servers. 
Java provides a number of very helpful language features, as well as a full run-time 
environment reminiscent of a traditional operating system. This combination of features 
and community support raises the question of whether Java is be ... 

125 How to port Linux when the hardware turns soft 
David Lynch 

January 2007 Linux Journal, volume 2007 issue 153 
Publisher: Specialized Systems Consultants, Inc. 

Full text available: g] html(287.14 KB) Additional Information: full citation , abstract , index terms 
Soul of the Pico machines 

126 24-hour knowledge factory: Using Internet technology to levera g e spatial and 
temporal separations 

Amar Gupta, Satwik Seshasai 

August 2007 ACM Transactions on Internet Technology (TOIT), volume 7 issue 3 
Publisher: ACM 

Full text available:^ pdf( 589. 35 KB) Additional Information: full citation , abstra ct, r eferences , index terms 

Several of the outsourcing endeavors of today will gradually converge to a hybrid 
outsourcing model that will involve a team spread across three or more strategically- 
located centers interconnected by Internet technology. White-collar professionals in the 
US, Australia, and Poland, for example, could each work on a standard 9—5 basis, transfer 
the activity to a colleague in the next center, thereby enabling work to be performed on a 
round-the-clock basis. The effective use of sequential work ... 
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Any auxiliary structure, such as a bitmap or a B + -tree index, that refers to rows of a table 
stored as a primary B + -tree (e.g., tables with clustered index in Microsoft SQL Server, or 
index-organized tables in Oracle) by their physical addresses would require updates due to 
inherent volatility of those addresses. To address this problem, we propose a mapping 
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Any auxiliary structure, such as a bitmap or a B + -tree index, that refers to rows of a table 
stored as a primary B + -tree (e.g., tables with clustered index in Microsoft SQL Server, or 
index-organized tables in Oracle) by their physical addresses would require updates due to 
inherent volatility of those addresses. To address this problem, we propose a mapping 
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There is a general consensus that Ada would benefit from having a library of tools 
available that would be standard across most or all implementations. This paper discusses 
how to get a "standard" I ibrary for Ada without impacting the ARM, increasing the 
leverage available to Ada developers, providing an enhanced product for the Ada compiler 
vendors, sparking new interest in Ada and perhaps building a financially successful 
business in the process. 
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The common vision of pervasive computing environments requires a very large range of 
devices and software components to interoperate seamlessly. From the assumption that 
these devices and associated software permeate the fabric of everyday life, a massive 
increase looms in the number of software developers deploying functionality into pervasive 
computing environments. This poses a very large interoperability problem for which 
solutions reliant solely on interoperability standards will not scale. ... 
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Vanishing Point is a presentation of the world as it responds to international newspaper 
coverage - not a measure of what the world is, but of what is most newsworthy. 
Consequently, countries that receive less media coverage gradually disappear from view. 
It consists of an interactive world map connected to a database fed by international news 
sources, and exists both in the form of a website (http://low-fi.org.uk/vanishingpoint) and 
as a physical gallery installation.The goal of this pie .... 
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Deciding what to teach novices about programming and what programming language to 
use is a common topic for debate. Should an industry relevant programming language be 
taught, or should a language designed for teaching novices be used? Typically, these 
questions are raised at university level, but in this paper we address them from a high 
school perspective. We present a case study with a twofold goal: (1) examining how 
programming can be introduced at high school level, and (2) evaluating how su ... 
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uncover the best tools for tracking down facts, news, 
people, music, and more. 

pcworld.about.com/magazjne/1909p109id55383.htm 
[Found on About] 

Search Engines 

Individual Search Engines | Meta Search Engines .... 
allows any type of search syntax and will translate and 
direct your search accordingly ... 

www.internettutorials.net/engines.htmi [Found on Google] 

: Class SearchEngineCornell 
... translate(java.lang. String query, java.lang. String 
method) ... Translate the query to allow the boolean 
expression ... 

www.dcs.warwick.ac.uk/-websearch/doc/uk/ac/warwick... 
[Found on Ask.com] 

Time-Saving Tips From the Pros 
Our five experts share their favorite secrets for working 
faster, not harder-from file management shortcuts to 
photo editing tricks. 

pcworld.about.com/magazine/21 11 p104id1 12466.htm 
[Found on About] 

Lang uag e and Lin g uistics 

Research Language and Linguistics at the Questia 

online library. 

Sponsored by: www.questia.com 

Launch Page - Lakewood Public Library 
Google Help | [Advanced Search] | Translate Also 
search: Gov't. ... METASEARCH ENGINE (Search 
Multiple Engines at Once) ... 

www.lkwdpl.org/lpl/ [Found on Ask.com] 

Ask Computer Questions 

Engineers, Tech Support & Other Computer Experts help 
online asap! 

Sponsored by: www.JustAnswer.com 
[Found on Ads by Ask.com] 



Find Online Deals 

Looking for metasearch format translate search? We 
have them. 

Sponsored by: www.FindStuff.com 
[Found on Ads by Ask.com] 
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10 ' The Hidden side of MetaSearch 

Metasearch is;. □ Single user query, multi-source 
search. □ On-demand, user driven, real time. □ All about 
personal choice. □ Multi-protocol, multi-format ... 

www.infonortics.com/searchengines/sh06/stides/noer... 
[Found on Google] 

11. Site > Blog | | Fagan Finder 

My seriously out-of-date Meta Search Engines page is 
now up to date. ... Big new feature today, the Fagan 
Finder Translation Wizard goes live in beta status ... 

www.faganfinder.com/blog/ [Found on Google] 

12. Search Engine Watch Blog: May 29. 2005 - 

June 04. 2005 Archives 
SES San Jose: Aug. 7-10, 2006 A- SES Local Search 
Edition - Denver: Sept. ... below: SearchDay (Daily) 
Search Engine Report (Monthly) ... 

blog.searchenginewatch.com/blog/050529-week 
[Found on Ask.com] 

13. sctcpz 

Pat2PDF Convert patent documents to PDF file format 
for easy viewing... 2 November 2005 

searchsiicks.port5.com/rfrnce/sctc_p__z.html 
[Found on Ask.com] 

14 * Search Engine Terms - Information for 
Translators 

Meta Search Engine Netfind Page Popularity 
Positioning ... If you would like to translate the l-Search 
glossary into your mother tongue then please contact ... 

www.cadenza.org/search__enginejerms/updates.htm 
[Found on Google] 

15. Search Update 

... the first of the search engines to translate Adobe 
PDF (portable document format) files into HTML and 
index them across the ... 

joycevalenza.com/searchupart.html (Found on Ask.com] 

16. Im plementing a customised meta-search 

interface for user query ... 

The query-translation problem. A meta-search engine 

submits queries over, multiple search engines stored 

in a unified format for further process. ... 

ieeexplore.ieee.org/iel5/7968/22030/01024655.pdf 
[Found on Google} 

17. Adaptive Data Model for Meta Search Engines 
Some results are very useful for constructing a meta 
search engine. But they do not consider the following 
problem: The translation of a query will be ... 

wyvw9.org/w9cdrom/165/165.html [Found on Google) 

18- Srch org - Search the top 6 Search Engines at 
the same time 

WebCrawler Web Search Home PageOfficial home of 
the WebCrawler metasearch engine. ... toots: Search, 
research, reference, look up or ... 

www.srch.org/ {Found on Ask.com] 

19. Distributed Information Search with Adaptive 
Meta-Search Engines 

kinds of differences make query translation from a meta- 
search engine to a .... and translates the results of the 
data source into the format that can be ... 

www.springerlink.com/index/5vq6u8agx1292drv.pdf 
[Found on Google] 
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20 * Knowledge Management and the Internet - 

Graziadio Business Report 
Table 1 Search Engines with Advanced Features ... 
power search commands that the software will translate 
for each search service, 

gbrpepperdine.edu/001 /search, html [Found on Ask.com] 



1 | 2 Next> 



d©gpile° 



Web | Images | Audio | Video | News | Yellow Pages | White Pages 



metasearch format translate search 



Now Searching: 



Google Iftatfooi search 



8f"dQ$pQC& About | About Dogpile | Tools & Tips | Download Toolbar | Add Dogpile Search to Your Site j Privacy Policy | Terms of Use | Contact Us 

(9 2007 InfoSpace, Inc. Alt Rights Reserved 



http://www.dogpilexom/dogpile/ws/res 



11/29/07 



Metasearch Format Translate Search - Dogpile Web Search 



Page 1 of 2 



d©gpile' 



Web 1 Images | Audio | Video | News | Yellow Pages | White Pages 



metasearch format translate search 



Now Searching: 



Learn Mors 



Advanced Searcl 
Preferences 



Web Search Results for "metasearch format translate search" 

Now Searching Google To*woo!«a*cm © : Lwe Search <JJ> 

21 - 26 of 26 from All Search Engines (Abo ut Re sults) 



And More.., 



Search Filter: Moderate 



<Prev 1I2 



21- Adaptive Web Meta-Search Enhanced by 

Constraint-based Query ... 
input interface construction with the constraints-based 
query translation method. Our. meta-search engine 
prototype dynamically generates the query input ... 

www.sprtngerlink.com/fndex/9wb0fejynecjdh3u.pdf 
(Found on Google] 

22. FREE GLOBAL GATEWAY MASTER 

ENHANCED NAVIGATOR SEARCH ENGINE 
If you want to search a portion of the page, first click it 
. with the mouse. ... Consider cutting and pasting the 
same search string you used ... 

www.geocities.com/humble3d/40.html [Found on Ask.com] 

23. Integration of Job Portals by Meta-search 
into one meta-search engine. First, existing job portals 
were investigated and XML. schemes were derived 
automated from these portals. Second, translation ... 

move. ec3. at/Papers/ Job Mela search.pdf [Found on Google] 

24 • Search Engines [CiteSeer; NEC Research 

Institute; Steve Lawrence. ... 

... pages with Learning Search Engine Specific Query 

Transformations for Question Learning Search 

Engine Specific Query ... 

citeseer.ist.psu.edu/WorldWideWeb/SearchEngines/da... 
[Found on Ask.com] 

25. ALA I 



26. 



SRU specifies a standard query grammar: CQL 
(Common Query Language). 7 This means that the meta- 
search engine only has to write one translator for all 
the ... 

www.ISta.org/ala/lita/litapublications/ital/252006... 
(Found on Google] 

CIS 101 Search Page 

Translate Advanced Search Settings Maps More ... all 
search engines also. Bookmark Searches. Metasearch 
ALL search engines with ... 

www.lcc.ctc.edu/faculty/drosi/101/search.xtm 
[Found on Ask.com] 
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